Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 1.430
Filter
1.
JMIR Public Health Surveill ; 10: e51880, 2024 Apr 24.
Article in English | MEDLINE | ID: mdl-38656780

ABSTRACT

During public health crises, the significance of rapid data sharing cannot be overstated. In attempts to accelerate COVID-19 pandemic responses, discussions within society and scholarly research have focused on data sharing among health care providers, across government departments at different levels, and on an international scale. A lesser-addressed yet equally important approach to sharing data during the COVID-19 pandemic and other crises involves cross-sector collaboration between government entities and academic researchers. Specifically, this refers to dedicated projects in which a government entity shares public health data with an academic research team for data analysis to receive data insights to inform policy. In this viewpoint, we identify and outline documented data sharing challenges in the context of COVID-19 and other public health crises, as well as broader crisis scenarios encompassing natural disasters and humanitarian emergencies. We then argue that government-academic data collaborations have the potential to alleviate these challenges, which should place them at the forefront of future research attention. In particular, for researchers, data collaborations with government entities should be considered part of the social infrastructure that bolsters their research efforts toward public health crisis response. Looking ahead, we propose a shift from ad hoc, intermittent collaborations to cultivating robust and enduring partnerships. Thus, we need to move beyond viewing government-academic data interactions as 1-time sharing events. Additionally, given the scarcity of scholarly exploration in this domain, we advocate for further investigation into the real-world practices and experiences related to sharing data from government sources with researchers during public health crises.


Subject(s)
COVID-19 , Information Dissemination , Public Health , Humans , COVID-19/epidemiology , Public Health/trends , Information Dissemination/methods , Government , Pandemics
2.
PeerJ Comput Sci ; 10: e1962, 2024.
Article in English | MEDLINE | ID: mdl-38660153

ABSTRACT

Data sharing is increasingly important across various industries. However, issues such as data integrity verification during sharing, encryption key leakage, and difficulty sharing data between different user groups have been identified. To address these challenges, this study proposes a multi-group data sharing network model based on Consortium Blockchain and IPFS for P2P sharing. This model uses a dynamic key encryption algorithm to provide secure data sharing, avoiding the problems associated with existing data transmission techniques such as key cracking or data leakage due to low security and reliability. Additionally, the model establishes an IPFS network for users within the group, allowing for the generation of data probes to verify data integrity, and the use of the Fabric network to record log information and probe data related to data operations and encryption. Data owners retain full control over access to their data to ensure privacy and security. The experimental results show that the system proposed in this study has wide applicability.

3.
JMIR Form Res ; 8: e53241, 2024 Apr 22.
Article in English | MEDLINE | ID: mdl-38648097

ABSTRACT

BACKGROUND: Electronic health records are a valuable source of patient information that must be properly deidentified before being shared with researchers. This process requires expertise and time. In addition, synthetic data have considerably reduced the restrictions on the use and sharing of real data, allowing researchers to access it more rapidly with far fewer privacy constraints. Therefore, there has been a growing interest in establishing a method to generate synthetic data that protects patients' privacy while properly reflecting the data. OBJECTIVE: This study aims to develop and validate a model that generates valuable synthetic longitudinal health data while protecting the privacy of the patients whose data are collected. METHODS: We investigated the best model for generating synthetic health data, with a focus on longitudinal observations. We developed a generative model that relies on the generalized canonical polyadic (GCP) tensor decomposition. This model also involves sampling from a latent factor matrix of GCP decomposition, which contains patient factors, using sequential decision trees, copula, and Hamiltonian Monte Carlo methods. We applied the proposed model to samples from the MIMIC-III (version 1.4) data set. Numerous analyses and experiments were conducted with different data structures and scenarios. We assessed the similarity between our synthetic data and the real data by conducting utility assessments. These assessments evaluate the structure and general patterns present in the data, such as dependency structure, descriptive statistics, and marginal distributions. Regarding privacy disclosure, our model preserves privacy by preventing the direct sharing of patient information and eliminating the one-to-one link between the observed and model tensor records. This was achieved by simulating and modeling a latent factor matrix of GCP decomposition associated with patients. RESULTS: The findings show that our model is a promising method for generating synthetic longitudinal health data that is similar enough to real data. It can preserve the utility and privacy of the original data while also handling various data structures and scenarios. In certain experiments, all simulation methods used in the model produced the same high level of performance. Our model is also capable of addressing the challenge of sampling patients from electronic health records. This means that we can simulate a variety of patients in the synthetic data set, which may differ in number from the patients in the original data. CONCLUSIONS: We have presented a generative model for producing synthetic longitudinal health data. The model is formulated by applying the GCP tensor decomposition. We have provided 3 approaches for the synthesis and simulation of a latent factor matrix following the process of factorization. In brief, we have reduced the challenge of synthesizing massive longitudinal health data to synthesizing a nonlongitudinal and significantly smaller data set.

5.
Bioscience ; 74(3): 169-186, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38560620

ABSTRACT

The impact of preserved museum specimens is transforming and increasing by three-dimensional (3D) imaging that creates high-fidelity online digital specimens. Through examples from the openVertebrate (oVert) Thematic Collections Network, we describe how we created a digitization community dedicated to the shared vision of making 3D data of specimens available and the impact of these data on a broad audience of scientists, students, teachers, artists, and more. High-fidelity digital 3D models allow people from multiple communities to simultaneously access and use scientific specimens. Based on our multiyear, multi-institution project, we identify significant technological and social hurdles that remain for fully realizing the potential impact of digital 3D specimens.

6.
J Dermatolog Treat ; 35(1): 2338280, 2024 Dec.
Article in English | MEDLINE | ID: mdl-38569598

ABSTRACT

For individuals with atopic dermatitis (AD), interpreting scientific papers that present clinical outcomes including the Eczema Area and Severity Index (EASI) and Investigators Global Assessment may be difficult. When compared to tabulated data and graphs, images from before and after treatment are often far more meaningful to these patients that ultimately will be candidates for the treatment. This systematic review focused on determining the frequency of clinical image sharing in AD research.Conducted in accordance with PRISMA guidelines, the review concentrated on randomized controlled trials that investigated predefined and available systemic treatments for AD. The search was performed in the MEDLINE database for studies published from the inception until 21 December 2023.The review included 60 studies, encompassing 17,799 randomized patients. Across these studies, 16 images representing 6 patients were shared in the manuscripts, leading to a sharing rate of 0.3‰.The almost missing inclusion of patient images in clinical trial publications hinders patient understanding. Adding images to scientific manuscripts could significantly improve patients' comprehension of potential treatment outcomes. This review highlights the need for authors, the pharmaceutical industry, study sponsors, and publishers to enhance and promote patient information through increased use of visual data.


Subject(s)
Dermatitis, Atopic , Humans , Dermatitis, Atopic/drug therapy , Prospective Studies , Randomized Controlled Trials as Topic , Treatment Outcome , Administration, Cutaneous , Severity of Illness Index
7.
bioRxiv ; 2024 Mar 18.
Article in English | MEDLINE | ID: mdl-38562736

ABSTRACT

The tree-like morphology of neurons and glia is a key cellular determinant of circuit connectivity and metabolic function in the nervous system of essentially all animals. To elucidate the contribution of specific cell types to both physiological and pathological brain states, it is important to access detailed neuroanatomy data for quantitative analysis and computational modeling. NeuroMorpho.Org is the largest online collection of freely available digital neural reconstructions and related metadata and is continuously updated with new uploads. Earlier in the project, we released multiple datasets together yearly, but this process caused an average delay of several months in making the data public. Moreover, in the past 5 years, >80% of invited authors agreed to share their data with the community via NeuroMorpho.Org, up from <20% in the first 5 years of the project. In the same period, the average number of reconstructions per publication increased 600%, creating the need for automatic processing to release more reconstructions in less time. The progressive automation of our pipeline enabled the transition to agile releases of individual datasets as soon as they are ready. The overall time from data identification to public sharing decreased by 63.7%; 78% of the datasets are now released in less than 3 months with an average workflow duration below 40 days. Furthermore, the mean processing time per reconstruction dropped from 3 hours to 2 minutes. With these continuous improvements, NeuroMorpho.Org strives to forge a positive culture of open data. Most importantly, the new, original research enabled through reuse of datasets across the world has a multiplicative effect on science discovery, benefiting both authors and users.

8.
J Med Internet Res ; 26: e49445, 2024 Apr 24.
Article in English | MEDLINE | ID: mdl-38657232

ABSTRACT

BACKGROUND: Sharing data from clinical studies can accelerate scientific progress, improve transparency, and increase the potential for innovation and collaboration. However, privacy concerns remain a barrier to data sharing. Certain concerns, such as reidentification risk, can be addressed through the application of anonymization algorithms, whereby data are altered so that it is no longer reasonably related to a person. Yet, such alterations have the potential to influence the data set's statistical properties, such that the privacy-utility trade-off must be considered. This has been studied in theory, but evidence based on real-world individual-level clinical data is rare, and anonymization has not broadly been adopted in clinical practice. OBJECTIVE: The goal of this study is to contribute to a better understanding of anonymization in the real world by comprehensively evaluating the privacy-utility trade-off of differently anonymized data using data and scientific results from the German Chronic Kidney Disease (GCKD) study. METHODS: The GCKD data set extracted for this study consists of 5217 records and 70 variables. A 2-step procedure was followed to determine which variables constituted reidentification risks. To capture a large portion of the risk-utility space, we decided on risk thresholds ranging from 0.02 to 1. The data were then transformed via generalization and suppression, and the anonymization process was varied using a generic and a use case-specific configuration. To assess the utility of the anonymized GCKD data, general-purpose metrics (ie, data granularity and entropy), as well as use case-specific metrics (ie, reproducibility), were applied. Reproducibility was assessed by measuring the overlap of the 95% CI lengths between anonymized and original results. RESULTS: Reproducibility measured by 95% CI overlap was higher than utility obtained from general-purpose metrics. For example, granularity varied between 68.2% and 87.6%, and entropy varied between 25.5% and 46.2%, whereas the average 95% CI overlap was above 90% for all risk thresholds applied. A nonoverlapping 95% CI was detected in 6 estimates across all analyses, but the overwhelming majority of estimates exhibited an overlap over 50%. The use case-specific configuration outperformed the generic one in terms of actual utility (ie, reproducibility) at the same level of privacy. CONCLUSIONS: Our results illustrate the challenges that anonymization faces when aiming to support multiple likely and possibly competing uses, while use case-specific anonymization can provide greater utility. This aspect should be taken into account when evaluating the associated costs of anonymized data and attempting to maintain sufficiently high levels of privacy for anonymized data. TRIAL REGISTRATION: German Clinical Trials Register DRKS00003971; https://drks.de/search/en/trial/DRKS00003971. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.1093/ndt/gfr456.


Subject(s)
Data Anonymization , Humans , Renal Insufficiency, Chronic/therapy , Information Dissemination/methods , Algorithms , Germany , Confidentiality , Privacy
9.
Clin Transl Oncol ; 2024 Apr 18.
Article in English | MEDLINE | ID: mdl-38635076

ABSTRACT

PURPOSE: This study has been focused on assessing the Open Science scenario of cancer research during the period 2011-2021, in terms of the derived scientific publications and raw data dissemination. METHODS: A cancer search equation was executed in the Science Citation Index-Expanded, collecting the papers signed by at least one Spanish institution. The same search strategy was performed in the Data Citation Index to describe dataset diffusion. RESULTS: 50,822 papers were recovered, 71% of which belong to first and second quartile journals. 59% of the articles were published in Open Access (OA) journals. The Open Access model and international collaboration positively conditioned the number of citations received. Among the most productive journals stood out Plos One, Cancers, and Clinical and Translational Oncology. 2693 genomics, proteomics and metabolomics datasets were retrieved, being Gene Expression Omnibus the favoured repository. CONCLUSIONS: There has been an increase in oncology publications in Open Access. Most were published in first quartile journals and received higher citations than non-Open Access articles, as well as when oncological investigation was performed between international research teams, being relevant in the context of Open Science. Genetic repositories have been the preferred for sharing oncology datasets. Further investigation of research and data sharing in oncology is needed, supported by stronger Open Science policies, to achieve better data sharing practices among three scientific main pillars: researchers, publishers, and scientific organizations.

10.
JMIR Med Inform ; 12: e49646, 2024 Apr 23.
Article in English | MEDLINE | ID: mdl-38654577

ABSTRACT

Background: The SARS-CoV-2 pandemic has demonstrated once again that rapid collaborative research is essential for the future of biomedicine. Large research networks are needed to collect, share, and reuse data and biosamples to generate collaborative evidence. However, setting up such networks is often complex and time-consuming, as common tools and policies are needed to ensure interoperability and the required flows of data and samples, especially for handling personal data and the associated data protection issues. In biomedical research, pseudonymization detaches directly identifying details from biomedical data and biosamples and connects them using secure identifiers, the so-called pseudonyms. This protects privacy by design but allows the necessary linkage and reidentification. Objective: Although pseudonymization is used in almost every biomedical study, there are currently no pseudonymization tools that can be rapidly deployed across many institutions. Moreover, using centralized services is often not possible, for example, when data are reused and consent for this type of data processing is lacking. We present the ORCHESTRA Pseudonymization Tool (OPT), developed under the umbrella of the ORCHESTRA consortium, which faced exactly these challenges when it came to rapidly establishing a large-scale research network in the context of the rapid pandemic response in Europe. Methods: To overcome challenges caused by the heterogeneity of IT infrastructures across institutions, the OPT was developed based on programmable runtime environments available at practically every institution: office suites. The software is highly configurable and provides many features, from subject and biosample registration to record linkage and the printing of machine-readable codes for labeling biosample tubes. Special care has been taken to ensure that the algorithms implemented are efficient so that the OPT can be used to pseudonymize large data sets, which we demonstrate through a comprehensive evaluation. Results: The OPT is available for Microsoft Office and LibreOffice, so it can be deployed on Windows, Linux, and MacOS. It provides multiuser support and is configurable to meet the needs of different types of research projects. Within the ORCHESTRA research network, the OPT has been successfully deployed at 13 institutions in 11 countries in Europe and beyond. As of June 2023, the software manages data about more than 30,000 subjects and 15,000 biosamples. Over 10,000 labels have been printed. The results of our experimental evaluation show that the OPT offers practical response times for all major functionalities, pseudonymizing 100,000 subjects in 10 seconds using Microsoft Excel and in 54 seconds using LibreOffice. Conclusions: Innovative solutions are needed to make the process of establishing large research networks more efficient. The OPT, which leverages the runtime environment of common office suites, can be used to rapidly deploy pseudonymization and biosample management capabilities across research networks. The tool is highly configurable and available as open-source software.

11.
Int J Eat Disord ; 2024 Apr 10.
Article in English | MEDLINE | ID: mdl-38597344

ABSTRACT

OBJECTIVE: To provide a brief overview of artificial intelligence (AI) application within the field of eating disorders (EDs) and propose focused solutions for research. METHOD: An overview and summary of AI application pertinent to EDs with focus on AI's ability to address issues relating to data sharing and pooling (and associated privacy concerns), data augmentation, as well as bias within datasets is provided. RESULTS: In addition to clinical applications, AI can utilize useful tools to help combat commonly encountered challenges in ED research, including issues relating to low prevalence of specific subpopulations of patients, small overall sample sizes, and bias within datasets. DISCUSSION: There is tremendous potential to embed and utilize various facets of artificial intelligence (AI) to help improve our understanding of EDs and further evaluate and investigate questions that ultimately seek to improve outcomes. Beyond the technology, issues relating to regulation of AI, establishing ethical guidelines for its application, and the trust of providers and patients are all needed for ultimate adoption and acceptance into ED practice. PUBLIC SIGNIFICANCE: Artificial intelligence (AI) offers a promise of significant potential within the realm of eating disorders (EDs) and encompasses a broad set of techniques that offer utility in various facets of ED research and by extension delivery of clinical care. Beyond the technology, issues relating to regulation, establishing ethical guidelines for application, and the trust of providers and patients are needed for the ultimate adoption and acceptance of AI into ED practice.

12.
JAMIA Open ; 7(2): ooae025, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38617994

ABSTRACT

Objectives: A data commons is a software platform for managing, curating, analyzing, and sharing data with a community. The Pandemic Response Commons (PRC) is a data commons designed to provide a data platform for researchers studying an epidemic or pandemic. Methods: The PRC was developed using the open source Gen3 data platform and is based upon consortium, data, and platform agreements developed by the not-for-profit Open Commons Consortium. A formal consortium of Chicagoland area organizations was formed to develop and operate the PRC. Results: The consortium developed a general PRC and an instance of it for the Chicagoland region called the Chicagoland COVID-19 Commons. A Gen3 data platform was set up and operated with policies, procedures, and controls for a NIST SP 800-53 revision 4 Moderate system. A consensus data model for the commons was developed, and a variety of datasets were curated, harmonized and ingested, including statistical summary data about COVID cases, patient level clinical data, and SARS-CoV-2 viral variant data. Discussion and conclusions: Given the various legal and data agreements required to operate a data commons, a PRC is designed to be in place and operating at a low level prior to the occurrence of an epidemic, with the activities increasing as required during an epidemic. A regional instance of a PRC can also be part of a broader data ecosystem or data mesh consisting of multiple regional commons supporting pandemic response through sharing regional data.

13.
Cureus ; 16(3): e56193, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38618347

ABSTRACT

In the ever-evolving landscape of biomedical research and publishing, the International Committee of Medical Journal Editors recommendations serve as a critical framework for maintaining ethical standards. By providing a framework that adapts to technological advancements, the International Committee of Medical Journal Editors recommendations actively shape responsible and transparent practices, ensuring the integrity of scientific inquiry and fostering global collaboration in the ever-evolving landscape of medical publishing. This editorial delves into key aspects of the latest changes in the International Committee of Medical Journal Editors recommendations, focusing on authorship, conflict of interest disclosure, data sharing and reproducibility, medical publishing and carbon emissions, the use of artificial intelligence, and the challenges posed by predatory journals within the realm of open access. It emphasizes the importance of new recommendations, which represent a beacon of ethical guidance in the ever-evolving domain of biomedical research and publishing.

14.
Exposome ; 4(1): osae003, 2024.
Article in English | MEDLINE | ID: mdl-38425336

ABSTRACT

The correlations among individual exposures in the exposome, which refers to all exposures an individual encounters throughout life, are important for understanding the landscape of how exposures co-occur, and how this impacts health and disease. Exposome-wide association studies (ExWAS), which are analogous to genome-wide association studies (GWAS), are increasingly being used to elucidate links between the exposome and disease. Despite increased interest in the exposome, tools and publications that characterize exposure correlations and their relationships with human disease are limited, and there is a lack of data and results sharing in resources like the GWAS catalog. To address these gaps, we developed the PEGS Explorer web application to explore exposure correlations in data from the diverse North Carolina-based Personalized Environment and Genes Study (PEGS) that were rigorously calculated to account for differing data types and previously published results from ExWAS. Through globe visualizations, PEGS Explorer allows users to explore correlations between exposures found to be associated with complex diseases. The exposome data used for analysis includes not only standard environmental exposures such as point source pollution and ozone levels but also exposures from diet, medication, lifestyle factors, stress, and occupation. The web application addresses the lack of accessible data and results sharing, a major challenge in the field, and enables users to put results in context, generate hypotheses, and, importantly, replicate findings in other cohorts. PEGS Explorer will be updated with additional results as they become available, ensuring it is an up-to-date resource in exposome science.

15.
BMC Med Res Methodol ; 24(1): 61, 2024 Mar 09.
Article in English | MEDLINE | ID: mdl-38461273

ABSTRACT

BACKGROUND: The provision of data sharing statements (DSS) for clinical trials has been made mandatory by different stakeholders. DSS are a device to clarify whether there is intention to share individual participant data (IPD). What is missing is a detailed assessment of whether DSS are providing clear and understandable information about the conditions for data sharing of IPD for secondary use. METHODS: A random sample of 200 COVID-19 clinical trials with explicit DSS was drawn from the ECRIN clinical research metadata repository. The DSS were assessed and classified, by two experienced experts and one assessor with less experience in data sharing (DS), into different categories (unclear, no sharing, no plans, yes but vague, yes on request, yes with specified storage location, yes but with complex conditions). RESULTS: Between the two experts the agreement was moderate to substantial (kappa=0.62, 95% CI [0.55, 0.70]). Agreement considerably decreased when these experts were compared with a third person who was less experienced and trained in data sharing ("assessor") (kappa=0.33, 95% CI [0.25, 0.41]; 0.35, 95% CI [0.27, 0.43]). Between the two experts and under supervision of an independent moderator, a consensus was achieved for those cases, where both experts had disagreed, and the result was used as "gold standard" for further analysis. At least some degree of willingness of DS (data sharing) was expressed in 63.5% (127/200) cases. Of these cases, around one quarter (31/127) were vague statements of support for data sharing but without useful detail. In around half of the cases (60/127) it was stated that IPD could be obtained by request. Only in in slightly more than 10% of the cases (15/127) it was stated that the IPD would be transferred to a specific data repository. In the remaining cases (21/127), a more complex regime was described or referenced, which could not be allocated to one of the three previous groups. As a result of the consensus meetings, the classification system was updated. CONCLUSION: The study showed that the current DSS that imply possible data sharing are often not easy to interpret, even by relatively experienced staff. Machine based interpretation, which would be necessary for any practical application, is currently not possible. Machine learning and / or natural language processing techniques might improve machine actionability, but would represent a very substantial investment of research effort. The cheaper and easier option would be for data providers, data requestors, funders and platforms to adopt a clearer, more structured and more standardised approach to specifying, providing and collecting DSS. TRIAL REGISTRATION: The protocol for the study was pre-registered on ZENODO ( https://zenodo.org/record/7064624#.Y4DIAHbMJD8 ).


Subject(s)
Information Dissemination , Research Design , Humans , Information Dissemination/methods , Consensus , Registries
16.
Alzheimers Dement ; 20(4): 2886-2893, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38456576

ABSTRACT

INTRODUCTION: Federal policies and guidelines have expanded the return of individual results to participants and expectations for data sharing between investigators and through repositories. Here, we report investigators' and study participants' views and experiences with data stewardship practices within frontotemporal lobal degeneration (FTLD) research, which reveal unique ethical challenges. METHODS: Semi-structured interviews with (1) investigators conducting FTLD research that includes genetic data collection and/or analysis and (2) participants enrolled in a single site longitudinal FTLD study. RESULTS: Analysis of the interviews identified three meta themes: perspectives on data sharing, experiences with enrollment and participation, and data management and security as mechanisms for participant protections. DISCUSSION: This study identified a set of preliminary gaps and needs regarding data stewardship within FTLD research. The results offer initial insights on ethical challenges to data stewardship aimed at informing future guidelines and policies.


Subject(s)
Frontotemporal Lobar Degeneration , Humans , Frontotemporal Lobar Degeneration/genetics , Atrophy , Research Personnel
17.
J Am Med Inform Assoc ; 31(5): 1135-1143, 2024 Apr 19.
Article in English | MEDLINE | ID: mdl-38457282

ABSTRACT

OBJECTIVES: Clinical trial data sharing is crucial for promoting transparency and collaborative efforts in medical research. Differential privacy (DP) is a formal statistical technique for anonymizing shared data that balances privacy of individual records and accuracy of replicated results through a "privacy budget" parameter, ε. DP is considered the state of the art in privacy-protected data publication and is underutilized in clinical trial data sharing. This study is focused on identifying ε values for the sharing of clinical trial data. MATERIALS AND METHODS: We analyzed 2 clinical trial datasets with privacy budget ε ranging from 0.01 to 10. Smaller values of ε entail adding greater amounts of random noise, with better privacy as a result. Comparison of rates, odds ratios, means, and mean differences between the original clinical trial datasets and the empirical distribution of the DP estimator was performed. RESULTS: The DP rate closely approximated the original rate of 6.5% when ε > 1. The DP odds ratio closely aligned with the original odds ratio of 0.689 when ε ≥ 3. The DP mean closely approximated the original mean of 164.64 when ε ≥ 1. As ε increased to 5, both the minimum and maximum DP means converged toward the original mean. DISCUSSION: There is no consensus on how to choose the privacy budget ε. The definition of DP does not specify the required level of privacy, and there is no established formula for determining ε. CONCLUSION: Our findings suggest that the application of DP holds promise in the context of sharing clinical trial data.


Subject(s)
Biomedical Research , Privacy , Information Dissemination/methods , Consensus
18.
JMIR Form Res ; 8: e49822, 2024 Mar 20.
Article in English | MEDLINE | ID: mdl-38506894

ABSTRACT

BACKGROUND: New federal policies along with rapid growth in data generation, storage, and analysis tools are together driving scientific data sharing in the United States. At the same, triangulating human research data from diverse sources can also create situations where data are used for future research in ways that individuals and communities may consider objectionable. Institutional gatekeepers, namely, signing officials (SOs), are therefore at the helm of compliant management and sharing of human data for research. Of those with data governance responsibilities, SOs most often serve as signatories for investigators who deposit, access, and share research data between institutions. Although SOs play important leadership roles in compliant data sharing, we know surprisingly little about their scope of work, roles, and oversight responsibilities. OBJECTIVE: The purpose of this study was to describe existing institutional policies and practices of US SOs who manage human genomic data access, as well as how these may change in the wake of new Data Management and Sharing requirements for National Institutes of Health-funded research in the United States. METHODS: We administered an anonymous survey to institutional SOs recruited from biomedical research institutions across the United States. Survey items probed where data generated from extramurally funded research are deposited, how researchers outside the institution access these data, and what happens to these data after extramural funding ends. RESULTS: In total, 56 institutional SOs participated in the survey. We found that SOs frequently approve duplicate data deposits and impose stricter access controls when data use limitations are unclear or unspecified. In addition, 21% (n=12) of SOs knew where data from federally funded projects are deposited after project funding sunsets. As a consequence, most investigators deposit their scientific data into "a National Institutes of Health-funded repository" to meet the Data Management and Sharing requirements but also within the "institution's own repository" or a third-party repository. CONCLUSIONS: Our findings inform 5 policy recommendations and best practices for US SOs to improve coordination and develop comprehensive and consistent data governance policies that balance the need for scientific progress with effective human data protections.

19.
J Health Psychol ; : 13591053241239109, 2024 Mar 28.
Article in English | MEDLINE | ID: mdl-38549221

ABSTRACT

Qualitative research plays a pivotal role in health psychology, offering insights into the intricacies of health-related issues. However, the specificity of qualitative methodology presents challenges in adhering to standard open science principles, including data sharing. The guidelines to address these issues are limited. Drawing from the author's experience in conducting in-depth interviews with middle-aged and older adults regarding their sexuality, this article discusses various challenges in implementing data sharing requirements. It emphasizes factors like participants' reasonable reluctance to share in specific populations, the depth of personal information gleaned from comprehensive interviews, concerns surrounding potential data misuse both within and outside academic circles, and the complex issue of obtaining informed consent. A universal approach to data sharing in qualitative research proves impractical, emphasizing the necessity for adaptable, context-specific guidelines that acknowledge the methodology's nuances. Striking a balance between transparency and ethical responsibility requires tailored strategies and thoughtful consideration.

20.
JMIR Public Health Surveill ; 10: e52587, 2024 Mar 28.
Article in English | MEDLINE | ID: mdl-38546731

ABSTRACT

The COVID-19 pandemic revealed that data sharing challenges persist across public health information systems. We examine the specific challenges in sharing syndromic surveillance data between state, local, and federal partners. These challenges are complicated by US federalism, which decentralizes public health response and creates friction between different government units. The current policies restrict federal access to state and local syndromic surveillance data without each jurisdiction's consent. These policies frustrate legitimate federal governmental interests and are contrary to ethical guidelines for public health data sharing. Nevertheless, state and local public health agencies must continue to play a central role as there are important risks in interpreting syndromic surveillance data without understanding local contexts. Policies establishing a collaborative framework will be needed to support data sharing between federal, state, and local partners. A collaborative framework would be enhanced by a governance group with robust state and local involvement and policy guardrails to ensure the use of data is appropriate. These policy and relational challenges must be addressed to actualize a truly national public health information system.


Subject(s)
COVID-19 , Humans , COVID-19/epidemiology , Pandemics/prevention & control , Public Health , Sentinel Surveillance , Information Dissemination
SELECTION OF CITATIONS
SEARCH DETAIL
...